Speaker role based structural classification of broadcast news stories
نویسندگان
چکیده
This paper is concerned with automatic classification of broadcast news stories based on speaker roles such as anchor, reporter and others. The story classification is the first step for many related tasks such as browsing, indexing, and summarising the news broadcast. We use broadcast news audio and its automatic speech recogniser transcripts to implement the classification system. It builds on speaker segmentation and identification, story segmentation and named entity identification. It has achieved 92% accuracy when individual stories were provided manually. The performance declined to 67% and 51%, of precision and recall related measures respectively, when combined with automatic story boundary segmentation.
منابع مشابه
Initial Study on Automatic Identification of Speaker Role in Broadcast News Speech
Identifying a speaker’s role (anchor, reporter, or guest speaker) is important for finding the structural information in broadcast news speech. We present an HMM-based approach and a maximum entropy model for speaker role labeling using Mandarin broadcast news speech. The algorithms achieve classification accuracy of about 80% (compared to the baseline of around 50%) using the human transcripti...
متن کاملLook Who is Talking: Soundbite Speaker Name Recognition in Broadcast News Speech
Speaker name recognition plays an important role in many spoken language applications, such as rich transcription, information extraction, question answering, and opinion mining. In this paper, we developed an SVM-based classification framework to determine the speaker names for those included speech segments in broadcast news speech, called soundbites. We evaluated a variety of features with d...
متن کاملMulti-View Approach for Speaker Turn Role Labeling in TV Broadcast News Shows
Speaker role recognition in TV Broadcast News shows is addressed in this paper. Speaker turns are assigned a role among anchor, reporter and other. A multi-view approach is proposed exploiting the complementarities of lexical cues obtained from Automatic Speech Recognition output and acoustical cues obtained from speech signal analysis. Early and late fusions are compared. 90.1% classification ...
متن کاملBrowsing System
In this demo, we present a system we have developed for automatic broadcast-quality video indexing that successfully combines results from the fields of speaker verification, acoustic analysis, very large vocabulary caption character recognition, content based sampling of video, information retrieval, dialogue systems, and ASF media delivery over IP. The prototype system of this demo is availab...
متن کاملSpectral cross-correlation features for audio indexing of broadcast news and meetings
This paper describes the effect of three new acoustic feature parameters to detect audio source segments that are based on spectral cross-correlation: spectral stability, white noise similarity, and sound spectral shape. These parameters are devised for accurate audio source detection and are used in a pre-processing module for automatic indexing of the broadcast news and the meetings. We condu...
متن کامل